SYSTEM AND METHOD FOR OPTIMIZING COLLECTION FROM SKIP ACCOUNTS 



Field of the Invention 

The present invention relates to data processing systems, 
5 and in particular, to systems that optimize collection of 
accounts receivables by using predictive models. 



Background of the Invention 

A financial institution such as a credit card issuer loses 
ffl3 millions of dollars from uncollectable accounts of card users. 
*0 In many cases, card users simply "skip town" and disappear, 

making them difficult to locate. An account is called a skip 
y account if there is no valid contact information of the account 

holder such as a telephone number. 
l[5 One traditional industry approach to collect on skip 

q accounts is to send them to a skip account tracer (database 

vendor) such as LEXIS-NEXIS or Equifax to obtain contact 

information in an attempt to locate the accounts. Generally, 

the tracers return telephone numbers of the account holders. 
20 The skip accounts with telephone numbers are then given to 

collection agents to be worked with the assumption that the 

obtained telephone numbers are valid. 

One problem with this approach is that the numbers obtained 

are often incorrect. Moreover, even if the numbers are correct, 
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the card holders often refuse to pay. These factors 
substantially drive up the cost of collecting from the skip 
accounts. Some card issuers give up even trying to locate the 
accounts and simply turn them over to a collection agency. 

Therefore, there is a need for an improved system and 
method for optimizing collection of outstanding balances from 
the skip accounts. 

Summary of the Invention 

According to the present invention, a system for optimizing 
collection of money from skip accounts is provided. The system 
includes a processor operable to execute programs instructions 
and a memory coupled to the processor. A predictive model 
stored in the memory is associated with an account tracing 
entity. The model processes data of a skip account to generate 
an output that indicates an expected recovery amount from the 
skip account. An analysis program stored in the memory 
determines a course of action based on the output of the 
predictive model . 

In one embodiment, the system includes a different 
predictive model for each tracing entity. The predictive models 
are applied to the skip account data to determine the expected 
recovery amounts for all tracing entities. The skip account is 
then sent to the tracing entity associated with the highest 



expected recovery amount as the optimal course of action to 
take. As can be appreciated, the system maximizes the expected 
recovery amount from skip accounts by evaluating the potential 
for recovery from using different tracing entities. 

Brief Description of the Drawings 

FIG. 1 is a block diagram of one implementation of a system 
in accordance with the present invention. 

FIG. 2 is a block diagram showing a functional architecture 
of the present invention. 

FIG. 3 is a flow chart of a method of analyzing a skip 
account for optimal collection according to the present 
invention. 

FIG. 4 is an exemplary predictive model using a set of 
equations to analyze the skip account according to the present 
invention. 

FIG. 5 is an exemplary predictive model related to the 
likelihood of locating a skip account through a selected tracing 
entity according to the present invention. 

FIG. 6 is an exemplary predictive model related to the 
liquidation rate of a skip account according to the present 
invention. 



FIG. 7 is an exemplary list of variables that may be used 
to develop the predictive models according to the present 
invention. 

Fig. 8 is an exemplary table illustrating degradation 
5 factors. 



Detailed Description of the Invention 

FIG. 1 shows a block diagram of one implementation of a 
system 100 in accordance with the present invention. Tracers 
ffO database 102 and accounts database 104 are connected to the 
43 system 100 through a conventional data network 106. The data 
0" 1 network can be a public network such as the Internet or a 
y private network. The accounts database 104 stores such customer 
f e data as customer profiles and past transactions as shown in FIG. 
J5 7. The tracers database 102 stores information related to the 
O past success or failure of locating skip accounts for various 
account tracing entities (also known as "tools") which are 
generally database vendors, private investigators or CM 
(continuous monitoring) tools. It should be understood that 
20 data as used herein means either a single data element or 
multiple data elements. 

The system 100 includes a processor 108, I/O interface 110 
connected to the data network 106, output device 112 and memory 
114 which are all connected to each other through a common bus 
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116. The processor 108 runs software program instructions 
stored in the memory 114. The memory 114 contains programs such 
as an analysis program 200 of FIG. 3, temporary data 120 and 
predictive models 122 that have been developed for the tracing 
5 entities. According to the software program instructions, the 
processor 108 stores the data obtained from the data network 106 
in the temporary data 120. The processor 108, temporary data 
120, programs 200 and predictive models 122 operate together to 
generate an output to the output device 112. The generated 
f{) output is indicative of an expected recovery amount from the 
y3 skip account from using a particular tracing entity to locate 
CP the account. In a preferred embodiment, a predictive model is 

developed for each tracing entity and the outputs of all 
f s predictive models are compared against each other to determine 
W5 an optimal course of action as will be explained in detail later 
□ herein. 

Referring now to FIG. 2, an overall functional architecture 
of the system 100 is shown. The system 100 includes two 
components: model development component 13 0 and model processing 
20 component (analysis program) 200. The model development 

component 13 0 uses past data 134 stored in the tracers and 
accounts databases 102, 104 to build a predictive model 136 for 
each tracing entity. In one embodiment, there are twelve 
different tracing entities. Thus, twelve predictive models are 
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developed by the model development component 13 0. Although a 
regression based predictive model is used in the embodiment 
shown, any type of predictive modeling technique such as neural 
network modeling may be used. 
5 The analysis program component 200 receives data 134 of a 

skip account being evaluated and applies the predictive model 
136 against the data. The output of the analysis program 200 is 
evaluated to determine a proper course of action to take as will 
be explained later in more detail with reference to FIG. 3. 
ED An exemplary predictive model is shown in FIG. 4. The 

predictive model in FIG. 4 includes four models: P (Locate), 
?! LR (Found), LR (Not Found) and LR(BAU). The P (Locate) model 
ii predicts a percentage likelihood of finding a skip account using 
L. a corresponding tracing entity. The LR (Found) model predicts a 
M net liquidation rate if a search is done and the account 
O (account user) is found. The liquidation rate is preferably in 
a range of 0 to 1 . The liquidation rate, however, can be less 
than 0 or greater than 1. The LR (Not Found) model predicts a 
net liquidation rate if a search is done, but the account is not 
20 found. In an exemplary embodiment, if the account is not found, 
the predicted net liquidation rate may change since data 
associated with the account is updated before rerunning the data 
through the respective model. In an alternative embodiment, the 
LR (Found) model and the LR (Not Found) model are changed if 



another search is done using a different tracing entity. The 
LR(BAU) model predicts a net liquidation rate if no search is 
performed and the skip account is instead sent to a collection 
agency. These four models will be explained in detail later 
5 herein. 

As shown in equation (1) of FIG. 4, the predictive model's 
output value E (value) is equal to Revenue (Skip) minus Revenue 
(Agency) where Revenue (Skip) represents a net revenue expected 
to be collected from a skip account if a corresponding account 
© tracing entity is used to locate the account, and Revenue 
3 (Agency) represents a net revenue expected to be collected from 
5 1 the skip account if no search is performed to locate the account 
^ and it is instead sent to a collection agency. 
L Equations (2) and (3) through (8) show more detailed 

tfc breakdowns of equation (1). In equation (2), Revenue (Found) 
O represents a revenue amount expected to be collected if the 

account tracing entity correctly locates the skip account. It 
is determined by multiplying the balance or accounts receivable 
Bal, output of the P (Locate) model, and output of the LR (Found) 
20 model together as shown in equation (3) . For example, if $500 
balance is owed on the skip account, there is a 30% chance that 
the corresponding tracing entity can locate the account, and the 
liquidation rate if found is 50%, then the Revenue (Found) is 
$75. 
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Revenue (Not Found) represents a revenue amount expected to 
be collected if the account tracing entity fails to locate the 
current skip account. The amount is usually greater than zero 
since many skip accounts do pay at least a portion of the 
5 balance on the account even if no special effort is made to 
collect on the account. Revenue (Not Found) is determined by 
multiplying the balance Bal, output of (1- P (Locate)) which is a 
percentage likelihood of failing to locate the skip account 
through the tracing entity, and LR (Not Found) together as shown 
3D in equation (4) . Continuing with the same example, if LR (Not 

Found) is 10%, then Revenue (Not Found) is $35. 
J] cost (Search) represents a cost of locating the skip 

y account through the account tracing entity. It is determined by 
L Cost (Tool) plus Cost (Verify) as shown in equation (5) . Cost 
3=5 (Tool) represents the amount the tracing entity charges which is 
Q usually a fixed fee per account. If the tracing entity is a 
private investigator, however, payment is made only for 
successfully located accounts. Thus, the cost of the tool is 
the fee charged multiplied by the percentage probability of 
20 success P (Locate) . Cost (Verify) represents the cost of 

verifying the accuracy of the locate information derived from 
the tracing entity. Typically, the locate information received 
from the tracing entity includes a telephone number of the 
account holder. In that case, the cost of verification is the 
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cost associated with calling the number. With the same example 
above, if the tracing entity used charges $1 for each locate and 
it costs $2.50 to verify, Cost (Search) is $3.50. 

Cost (Collect Skip) represents a cost of collecting from 
the skip account after the account is sent to a tracing entity, 
but regardless of whether the account is accurately located or 
not. As shown in equation (6), it is determined by Revenue 
(Found) times Comm (Collect Internal) , which is an internal cost 
of collection, plus Revenue (Not Found) times Comm (Collect 
External) , which is an external cost of collection. Comm 
(Collect Internal) represents the card issuer's internal 
commission rate of collection. In one embodiment, the 
commission is a fixed percentage of amount collected. Comm 
(Collect External) represents the issuer's external commission 
rate of collection if it were to outsource the account to a 
collection agency after having failed to locate the account 
through the tracing entity. Continuing with the same example 
above, if the internal and external commission rates are 12%, 
Cost (Collect Skip) is $9 plus $4.20 which is a total of $13.20. 

Revenue (BAU) , where BAU stands for business as usual, 
represents a revenue amount expected to be collected if the 
account is sent to a collection agency with no action being 
taken to locate the account through the account tracing entity. 
As shown in equation (7) , it is determined by multiplying the 



output of the LR(BAU) model by the balance on the account. 
Continuing with the same example above, if LR(BAU) outputs .2 
(20%), then Revenue (BAU) is $100. 

Cost (Collect BAU) represents a cost of collecting from the 
skip account if the account is sent to a collection agency with 
no action being taken to locate the account through the account 
tracing entity. As shown in equation (8), it is determined by 
multiplying Revenue (BAU) by Comm (Collect External). Continuing 
with the same example above, Cost (Collect BAU) is $12. 

The example used above yields a final E (value) of $5.30. 
This means that it is more efficient to send the skip account to 
a tracing entity rather than send the skip account to a 
collection agency as will be discussed in more detail with 
reference to FIG. 3. 

Referring now to FIG. 5, an example P (Locate) model for one 
tracing entity is illustrated. In the embodiment shown, the 
P (Locate) models are logistic regression models. These models 
are used because the input variables are both continuous and 
discrete and the desired output is discrete. As is well-known, 
these types of models are fed a specific data set, which is used 
to predict the percent likelihood of a given event occurring. 
The result of the model is given in the form of a percentage, 
between 0 and 1 . 
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As discussed earlier, the P (Locate) model predicts a 
percentage probability of locating a given account using a given 
tracing entity. Knowing whether one tracing entity is, for 
example, 2% or 56% likely to find an account is important 
5 information in determining whether it is efficient to send a 
skip account to that tracing entity. 

To develop the P (Locate) model for each tracing entity, 
accounts data sets involving variables such as shown in FIG. 7 
and the tracers data are processed. In the embodiment shown, 
10 one P (locate) model is developed for one corresponding tracing 
P. entity. By separating the models among the different tracing 
5 entities, each model uses only the data elements which are 
S relevant to it, and it can weight each piece of information 

'•as? S 

u differently. Also, different P (Locate) models can "bin" 
i*5 information differently. For example, while two models may both 
O use the FICO score to predict location, one model may attribute 
negative probability to accounts whose FICO score is less than 
500 and the other model may attribute positive probability to 
accounts whose FICO score is greater than 550. Other models may 
20 use different balance cuts. As can be appreciated by persons 
skilled in the art, the use of separate models for different 
tracing entities provides a much more accurate prediction. 

Still referring to FIG. 5, the P (Locate) model has 3 
columns. The first column contains the variable names, the 
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third column contains the variable conditions, and the second 
column contains the parameter values to be added if the variable 
conditions are w true /; . The Intercept is always added to the 
model output calculation. The output is then determined by the 
5 following logistic regression equation (9) . 

P (Locate) = 1 / (1 + e" x ) (9) 
where x represents the addition of the Intercept and all 
other parameters whose corresponding variable conditions are 
u true" . 

WO For example, for a particular skip account, if the Days 

^ Since Address Change is greater than 125 days, the State of 
£l Residence is a New England State, and Balance was greater than 
ff§ $500, then the addition of the intercept and all other 
y, parameters for true conditions is -2.0088 and P (Locate) is 
]45 determined to be 0.118. That means there is an 11.8% likelihood 
C that the account will be located by the corresponding tracing 
entity. 

Referring to FIG. 6, an example Revenue (Found) model is 
illustrated. In the embodiment shown, the model is based on 
20 CHAID (Chi-square Automatic Interaction Detection) . The Revenue 
(Found) model predicts a net liquidation rate of an account over 
time if a search is done and the account is found. In one 
embodiment, the rate is determined by the sum of (payments made 
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- (purchases + cash advances + balance transfers) ) divided by 
the balance on the account at the time the model processes the 
account . 

To develop the Revenue (Found) model, an accounts data set 
5 stored in accounts database 104 involves variables such as shown 
in FIG. 7. Alternatively, tracers data stored in tracers 
database 102 can also be used with the accounts data. In the 
embodiment shown, one Revenue (Found) model is developed for use 
by all corresponding tracing entities in the formula E (value) in 
30 FIG. 4. 

^ The Revenue (Found) CHAID model generally works as decision 

%l trees. Each branch of the tree represents a different condition 
71 (for example, if the balance is greater than or less than $500) . 
!«& At each decision point, there is an expected LR. Also, at the 
145 end of each branch there are "nodes" which list an expected LR. 
O An account that meets the conditions of the particular node 

3 s 
sss;; 

would have that expected LR. The nodes are mutually exclusive 
and collectively exhaustive. If for some reason some accounts 
do not have the data element available, the LR at the last node 
20 for which all of the needed information is available is used. 
Still referring to FIG. 6, if the balance on the skip 
account is greater than $1,500 and the FICO score is greater 
than or equal to 400, then the output of the Revenue (Found) 
model is .511 or 51.1%. 
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The Revenue (Not Found) and Revenue (BAU) models are also 
CHAID models that are developed in a similar manner as the 
Revenue ( Found ) mode 1 . 

Once all of the predictive models 136 are developed, they 
are stored in the memory 114 at 122 for fast access by the 
analysis program 200. Referring to FIG. 3, the program in step 
202 receives the customer data stored in database 104 including 
customer profile data such as ACCTTYP, CREDIT LIMIT and STATE 
(see FIG. 7 for definition) , and transaction data such as 
BALANCE, LASTPMT and TOTPAY12 . In step 204, the program 200 
applies the received data to the predictive model associated 
with a selected tracing entity to generate an output E (value) . 
The output E (value) represents an expected recovery amount from 
the skip account being processed. The output is then stored in 
the memory 114 at 120 in step 206. Alternatively, the output 
can be sent to the output device 112. 

In step 2 08, it is determined whether there are more 
tracing entities to process. If YES, control passes to step 210 
where the next tracing entity is selected and steps 204 and 2 06 
are repeated for each tracing entity. If the decision at 208 is 
NO, then there are no more tracing entities to process and the 
program 200 continues with step 212. At 212, the program 
evaluates all outputs of the predictive models and selects the 
highest output . 
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In step 214, it is determined whether the selected output 
is negative, meaning that the expected recovery amount is 
negative even from using the best tracing entity for the 
particular skip account being processed. If YES, then control 

5 passes to step 216 where the account is sent to a collection 

agency as the proper course of action to take. If the decision 
is NO, then the program continues with step 218 where the skip 
account is sent to the tracing entity associated with the 
highest output in order to locate the account. In step 22 0, the 

IS location data, typically including a telephone number of the 

y3 account holder, is received from the tracing entity for 

u* verification. 

^ In step 222, it is determined whether the location data 

^ returned is accurate. This step typically involves calling the 
B telephone number to verify that the number belongs to the proper 
□ account holder. If the location data is determined to be 
inaccurate, control passes to step 2 02 where the entire 
decisioning process (steps 202 through 222) is repeated for the 
remaining tracing entities. Before the redecisioning, the 
20 P (Locate) is adjusted to take into account that the "best" 

tracing entity failed to locate the account. The customer data 
stored at the database 104 are received again because a 
significant amount of time may have passed between the previous 



15 



decisioning process and the next, and many of the data may have 

changed during that time. 

In an exemplary embodiment, P (Locate) is adjusted in the 
following way. A P (Locate) is calculated using updated data by 
5 the same P (Locate) model described above. Next, the P (Locate) 
is multiplied by a degradation factor. The degradation factor 
is determined, for example, during past testing for each vendor. 
The degradation factor estimates the reduction in P (Locate) for 
each tool, given that 1, 2, 3, 4 ...or 11 tools have attempted to 
IB locate an account, but failed. If more tools are added, then 
C the values of the degradation factors would have to be 
?! recalculated. The degradation factors are stored in a table, 
S for example, a 11 X 11 table, which the models use to look up 
u the appropriate values, as shown in Fig. 8. The degradation 
2 factors illustrated in Fig. 8 are merely exemplary. For 
Q example, if an account was run through a P (Locate) model of tool 
C and received a value of .35 and the account had already been 
sent unsuccessfully to tools A and B, then the number of tools 
sent to previously would be two. Thus, referring to the table 
20 shown in Fig. 8, the .35 P (Locate) value would be adjusted by 

multiplying the value by the .41 degradation factor since tool C 
was used and the account had already been sent to two tools. 
The final P (Locate) would be .35 X .41 which equals .1435. 
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If the location data is determined to be accurate in step 
222, then control passes to step 224 where the account is sent 
to the card issuer's internal collection department for 
collection. In one embodiment, an account verifier calls the 
5 telephone number obtained from the tracing entity. If the 

number is verified to be accurate, the call is transferred to an 
internal collection agent to be worked. The program then ends 
at step 226. 

From the foregoing, it will be appreciated that, although 
M specific embodiments of the invention have been described herein 
* for purposes of illustration, various modifications may be made 
J without deviating from the spirit and scope of the invention. 
| For example, although the illustrated predictive model includes 
L four different models, the present invention can operate with 
3 one or more models such as a system based only on the 
0 probability model outputting a likelihood of locating a skip 
account by a tracing entity. In that case, the skip account 
will be sent to the tracing entity associated with the highest 
probability. Accordingly, the present invention is not limited 
20 except as by the appended claims. 
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